The eternal sunshine of the sketch data structure
نویسندگان
چکیده
In the past years there has been significant research on developing compact data structures for summarizing large data streams. A family of such data structures is the so-called sketches. Sketches bear similarities to the well-known Bloom filters [2] and employ hashing techniques to approximate the count associated with an arbitrary key in a data stream using fixed memory resources. One limitation of sketches is that when used for summarizing long data streams, they gradually saturate, resulting in a potentially large error on estimated key counts. In this work, we introduce two techniques to address this problem based on the observation that real-world data streams often have many transient keys that appear for short time periods and do not re-appear later on. After entering the data structure, these keys contribute to hashing collisions and thus reduce the estimation accuracy of sketches. Our techniques use a limited amount of additional memory to detect transient keys and to periodically remove their hashed values from the sketch. In this manner the number of keys hashed into a sketch decreases, and as a result the frequency of hashing collisions and the estimation error are reduced. Our first technique in effect slows down the saturation process of a sketch, whereas our second technique completely prevents a sketch from saturating. We demonstrate The phrase “eternal sunshine” in the title reflects that our techniques mitigate or halt the saturation process of a sketch.
منابع مشابه
Types of Sketch and Deployment of Allusion on Vertical Axis of Simin Behbahani\'s New Sonnets
Simin Behbahani is one of the contemporary ingenious and efficacious poets. In her New-Sonnets (Nou-Ghazal), the frequency and variety of allusions are too many, so that the allusion is one of the most striking features of her poetic style. Accurate understanding of semantic capacities and structural concinnities of the word, which has allusive theme, depends on being familiar with meaning and ...
متن کاملشریعت - شهر اسلامی
Although in the structural approaches to the concept of place identity, various aspects of the built environment, natural environment, social environment and human activities are considered as the factors of the city identifying. In the normative approaches, adaptation of the socio-spatial structure of the city with the values, culture and human needs have major contribution in its identity. Wi...
متن کاملEternal m-security subdivision numbers in graphs
An eternal $m$-secure set of a graph $G = (V,E)$ is aset $S_0subseteq V$ that can defend against any sequence ofsingle-vertex attacks by means of multiple-guard shifts along theedges of $G$. A suitable placement of the guards is called aneternal $m$-secure set. The eternal $m$-security number$sigma_m(G)$ is the minimum cardinality among all eternal$m$-secure sets in $G$. An edge $uvin E(G)$ is ...
متن کاملEternal m- Security Subdivision Numbers in Graphs
Let be a simple graph with vertex set and edges set . A set is a dominating set if every vertex in is adjacent to at least one vertex in . An eternal 1-secure set of a graph G is defined as a dominating set such that for any positive integer k and any sequence of vertices, there exists a sequence of guards with and either or and is a dominating set. If we take a guard on every ver...
متن کاملمطالعه نقشمایه گلدانی در قالیهای خشتی روستایی چهارمحال و بختیاری (با تأکید بر مناطق چالشتر، شلمزار و بلداجی)
The ancient motif of vase, whether individually or in combination with other motifs, is one of the most basic patterning motifs in visual culture of Iran and is among the manifestations of eternal verdancy. Multiplicity of its types, its various forms, flexibility of its structure and its combination with other motifs have caused that the vase motif become a basic and guiding elements for...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Computer Networks
دوره 52 شماره
صفحات -
تاریخ انتشار 2008